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CENTRAL LIMIT THEOREMS FOR AN 
INDIAN BUFFET MODEL WITH RANDOM WEIGHTS 

PATRIZIA BERTI, IRENE CRIMALDI, LUCA PRATELLI, AND PIETRO RIGO 



Abstract. The three-parameter Indian buffet process is generaUzed. The 
possibly different role played by customers is taken into account by suitable 
(random) weights. Various limit theorems are also proved for such generalized 
Indian buffet process. Let Ln be the number of dishes experimented by the first 
n customers, and let K„ = (1/n) 5I]"=i ^i where Ki is the number of dishes 
tried by customer i. The asymptotic distributions of L„ and Kn, suitably 
centered and scaled, are obtained. The convergence turns out to be stable 
(and not only in distribution). As a particular case, the results apply to the 
standard (i.e., non generalized) Indian buffet process. 



1. Introduction 



Let {X,B) be a measurable space. Think of A" as a collection of features po- 
tentially shared by an object. Such an object is assumed to have a finite number 
of features only and is identified with the features it possesses. To investigate the 
vQ , object, thus, we focus on the finite subsets of X. 

^SJ ' Each finite subset B d X can be associated to the measure fiB = ^xeB^^^ 

^O , where /i0 = and d^ denotes the point mass at x. If B is random, fig is random 

^n ■ as well. In fact, letting F = {/is : B finite}, there is a growing literature focusing 

Tlj" , on those random measures M satisfying M ^ F a.s.. (Measurability of the set 

^D ' {M e F} can be proved under mild assumptions on {X, B)). 

A remarkable example is the Indian Buffet Process (IBP) introduced by Griffiths 
and Ghahramani and developed by Thibaux and Jordan; see [13], [14], [23]. The 
objects are the customers which sequentially enter an infinite buffet X and the 
features are the dishes tasted by each customer. In this framework, each customer 
""{Jj [ is modeled by a (completely) random measure M such that M £ F a.s.. The atoms 

Cu ' of M represent the dishes experimented by the customer. 

Our starting point is a three-parameter extension of IBP, introduced in [7] and 
[22) to obtain power-law behavior. From now on, such an extension is referred to 
as the standard IBP. The dynamics is as follows. Let a, (3 and c be real numbers 
such that a>0, 0</3<l and c > — /3. Customer 1 tries Poi(a) dishes. For each 
n > 1, let Sn denote the collection of dishes experimented by the first n customers. 
Then: 

— Customer n+1 selects a subset S^ C Sn- Each a; e S'„ is included or not into 
5* independently of the other members of Sn- The inclusion probability 
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is "^ "+K where m{n,x) is the number of previous customers who tried 
dish X. 
— In addition to S*, customer n + 1 also tries Poi(a„_|_i) new dishes, where 

_ r(c+l)r(c+/3+n) 
""+1 ~ "^ r(c+/3)r(c+l+n) ■ 

For /3 = 0, such a model reduces to the original IBP of [13], [M], [23] , 

IBP is a flexible tool, able to capture the dynamics of various real problems. 
Applications of IBP range from Bayesian nonparametrics J16j and causal inference 
[25] to modeling of choices [H], similarity judgements [30] and dyadic data [15] . 

Despite its prominent role, however, the asymptotics of IBP is largely neglected. 
To the best of our knowledge, the only known fact is the a.s. behavior of L„ (defined 
below) and some other related quantities for large n; see (Yj and ^. Nothing is 
known as regards limiting distributions. 

This paper aims to do two things. 

First, to generalize the standard IBP in order to account for the different impor- 
tance of customers. To this end, customer n is attached a random weight i?„. For 
each X € Sn, the inclusion probability becomes "c+y" R where Mi{x} is the 
indicator of the event {customer i selects dish x}. If i?„ = 1 for all n, the model 
reduces to the standard IBP. See [2], [3], [4], [5], [6], [21] for analogous models in 
different settings, and [10], [11], [H], [21] for other IBP-type models. 

Second, to investigate the asymptotics of the previous generalized IBP model. 
We focus on 

Ln = number of dishes experimented by the first n customers, and 

— 1 " 

Kn = — 7 Ki where Ki — number of dishes tried by customer i. 

i=l 

Three results are obtained. Define an{P) — \ogn if /3 = and a„(/3) = n^ if 
/? G (0, 1). Then, under reasonable conditions on the weights i?„, it is shown that: 

(i) ^^fg) — > A, where A is a certain constant depending on a, fi and c; 
(ii) VM^ { ^ - ^} ^ -^(0. ^) stably; 

(iii) Kn ^-4 Z and ^/n{^Kn — Z] — > A/'(0, a^) stably, where Z and a^ are 
suitable random variables. 

Stable convergence is a strong form of convergence in distribution. The basic defi- 
nition is recalled in Subsection 12.31 

Among other things, the above results can be useful to make (asymptotic) infer- 
ence on the model. As an example, (i) implies that 

^ _ logLn 
Pn — , 

iogn 

is a strongly consistent estimator of the parameter /3. In turn, (ii) provides the 
limiting distribution of /?„ so that simple tests on /3 can be manufactured. Similarly, 
(iii) can be exploited to get asymptotic confidence bounds for the random limit Z 
oi Kn] see Subsection l5.ll 
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A last remark is that (i)-(ii)-(iii) hold in particular when i?„ ~ 1 for all n. Thus, 
(ii) and (iii) provide the limiting distributions of i„ and K^ in the standard IBP 
model. 

2. Preliminaries 

2.1. Basic notation. Throughout, A" is a separable metric space and B the Borel 
cr- field on X. We let 

M. ~ \\x : [i Ya & finite positive measure on B\ 

and we say that /i G Al is diffuse in case n{x} ~ for all x (z X. 

All random variables appearing in this paper, unless otherwise stated, are defined 
on a fixed probability space {n,,A,P). li G C A is a. sub-cr- field and X and Y are 
random variables with values in the same measurable space, we write 

x\g^Y\g 

to mean that P{X E A \ Q) = P{Y E A \ Q) a.s. for each measurable set A. 

Finally, Poi(A) is the Poisson distribution with mean A and Af{0, A) the Gaussian 
distribution with mean and variance A. Here, A > and Poi(O) = Af{0, 0) = Sq. 

2.2. Random measures. A random measure (r.m.) is a map M : fl ^ Jvl such 
that oj I—)- M{uj){B) is ^-measurable for each B G B. In the sequel, we write M{B) 
to denote the real random variable w i— >■ Ad{ijj){B). Similarly, if / : A" — > M is a 
bounded measurable function, M(f) stands for 

Miu;){f) = J fix)Miu;)idx). 

A completely r.m. is a r.m. M such that M{Bi), . . . , M{Bk) are independent 
random variables whenever i?i, . . . , Bk G B are pairwise disjoint. 

Let z^ G A^. A Poisson r.m. with intensity v ]s a, completely r.m. M such that 
M{B) ~ Y'oi{v{B)) for aU B E B. Note that M{B) = a.s. in case v{B) = 0. 
Note also that the intensity v has been requested to be a finite measure (and not a 
cr- finite measure as it usually happens). 

We refer to chapter VI of [8] for Poisson r.m.'s. We just note that a Poisson 
r.m. with intensity v is easily obtained. It suffices to let M = if v(X^ ~ 0, and 
otherwise 

N 
M = I{N>0} X! '^^J ' 

where (Xj) is an i.i.d. sequence of ^Y-valued random variables with Xi ^ vlv[X), 
N is independent of {Xj) and N ^ Poi(z^(A')). 

As explained in Section [TJ in this paper we focus on those r.m.'s M satisfying 
M E F a.s., where F — {^b ■ B finite} and fiB = J2xeB ^x- ^^ ^'^ ^^ ^ Poisson r.m. 
with intensity v, then M E F a.s. if and only if i/ is diffuse. Therefore, another 
class of r.m.'s is to be introduced. 

Each ly E Ai can be uniquely written as ly ^ h'c + i^d, where Vc is diffuse and 

3 

for some jj > and Xj E X. (The case Vd = corresponds to -fj ~ for all j). Say 
that M is a Bernoulli r.m. with hazard measure u, where i^ G A^, if 
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• M = Ml + M2 with Ml and A/2 independent r.m.'s; 

• Ml is a Poisson r.m. with intensity Vc] 

• M2 — X^i ^ ^xj where the Vj are independent indicators satisfying 

Some (obvious) consequences of the definition are the following. 

— M — Ml a.s. \i V — Vc and M — M^ a.s. \i v — Vd] 

— M is a completely r.m.; 

— M e F a.s.. 
We will write 

M - BeP{v) 
to mean that Af is a Bernoulli r.m. with hazard measure v. 

2.3. Stable convergence. Stable convergence is a strong form of convergence in 
distribution. We just recall the basic definition and we refer to [9], [15] and refer- 
ences therein for more information. 

A r.m. K such that K{ijj){X) — 1, for all w e 51, is said to be a kernel or a 
random probability measure. Let K he a kernel and (Xn) a sequence of A'-valued 
random variables. Say that Xn converges stably to K if 

E{K{})\H)^\iniE{!{Xn)\H) 

for a\\ H ^ A with P{H) > and all bounded continuous / : A" — > M. (Recall that 
A denotes the basic cr-field on il). For H = fl, stable convergence trivially implies 
convergence in distribution. 

3. The model 

Let (Mji : n > 1) be a sequence of r.m.'s and (i?„ : n > 1) a sequence of real 
random variables. The probability distribution of ((A/„,_R„) : n > 1) is identified 
by the parameters m, a, l3 and c as follows. 

• m is a diffuse probability measure on B; 

• a, f3, c are real numbers such that a > 0, /3 < 1 and c > — /3; 

• i?„ > u > /3+ , for some constant u, and i?„ independent of {Mi, . . . , M„, i?i, . . . , i?„ 
for each n > 1; 

• Mn+i I J-n ^ BeP{vn) for all n > 0, where 

J"o = {0,ri}, VQ = am, J"„ = cr(Mi,...,M„,i?i,...,i?„), 

Er=i ^^Af.M - /3 ^ , r(c + i)r(c + /3 + ELi^0 






,,s„ Y:=iRi + c r(c + /3)r(c + i + E:Lii?.) 

and 5„ = {a; G A" : Mi{x} = 1 for some i = 1, . . . , n}. 

Our model is the sequence ((Af„, i?„) : n > 1). It reduces to the standard IBP in 
case i?„ = 1 for all n and /3 G [0, 1). Note that Mi is a Poisson r.m. with intensity 
am. Note also that A'f„ G F a.s. for all n > 1, so that 

n 

Sn = [J Support(Mi) a.s.. 
1=1 
As remarked in Section [H _R„ should be regarded as the weight of customer n. 
Thus, the possibly different role played by each customer can be taken into account. 
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Instead, m, a, /3 and c have essentially the same meaning as in the standard IBP 
model (apart from /3 may be negative). Precisely, a is a mass parameter controlling 
the total number of tried dishes, c is a concentration parameter controlling the 
number of customers which try each dish, and /3 is a stability exponent tuning 
the power-law behavior of the process (as far as /? > 0) . The probability measure 
m allows to draw, at each step n > 1, an i.i.d. sample of new dishes. In fact, 
m(^X \ Sn) = 1 a.s. for m is diffuse and Sn finite a.s.. 

A r.ni. can be seen as a random variable with values in (M, S), where S is the 
(T-field on Ai generated by the maps /i i— ?► ii{B) for all B E B. In the standard 
IBP case, (Af„) is an exchangeable sequence of random variables. Now, because 
of the Rn, exchangeability is generally lost. In fact, the same phenomenon (loss 
of exchangeability) occurs in various other extensions of IBP; see [10], [11], [19j . 
[24j . However, for /3 = 0, (M„) is conditionally identically distributed (c.i.d.) with 
respect to the filtration 

Go = {0, n}, Gn = Tn V <T(i?„+l) = a{Mi, . . . , M„, i?i, . . . , i?„, i?„+i). 

This means that 

Mk I Gn ^ Mn+1 I Gji for all fc > n > 0. 

Among other things, Mk ~ Mi for all fc > 1 whenever (A/„) is c.i.d.. 

C.i.d. sequences need not be exchangeable, but their asymptotic behavior is 
rather close to that of exchangeable ones. We refer to 4 for more on c.i.d. se- 
quences. Here, we make the conditions for (M„) to be c.i.d. precise. Define 

. , . T{c+l)^ic + /3 + J:t,R^) 
An — a and A„ = a — ; ^ — ■ "„ r • 

r(c-f/3)r(c + i + ELi^0 

Lemma 1. (M„) is c.i.d. with respect to (Gn) if o,nd only if 

Rn+i — P 



(1) A„+i - A„ (1 - 

Proof. We just give a sketch of the proof. First note that, since i?„+i is independent 

of (Afi, . . . , M„, Af„+i, i?i, . . . , i?„), then 

P{M,,+i e • I Gn) = P(M„+i e • I J-„) a.s.. 
Further, (Af„) is c.i.d. with respect to (Gn) if and only if 

(2) M„+2iB) \ Gn - Mn+iiB) \ Gn for each n > and B € B. 

In fact, conditionally on Gm the r.m.'s Mn+i and Mn+2 are both completely r.m.'s 
(we omit the proof of this fact). Hence, condition ([2]) amounts to 

Mn+2 I Gn ^ Mn+1 \ Gn for all n > 0. 

Such condition is equivalent to Mk \ Gn ^ M„+i \ Gn for all fc > ?i > 0, as it can be 
shown by a simple induction on fc. 

We next prove that ([T]) <=> ([2]). Fix n > and B E B. It can be assumed 
m{B) > 0. For t e M, one obtains 






exp(m(i3)(e*-l)A„) J] {l + (^^ - 1) ^t + fi^^'^''^ } ""■'■ 

xeSnDB 
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where the second equality is because Mn+i \ Tn ^ BeP(yn). Similarly, 

= exp(™(i?)(e*-l)A„+i)i?{ n (l + (^^'-l) '^^^H+^'f^"^ )|g4 
Finally, it can be shown that 



c + ES'-R. 






a:eS„+inS 



= exp(m(S)(e -1)A„— — ^^ 11 |l + (^-l) c + E" i i?. / 

Again, we omit the calculations for this last equality. Therefore, condition ([T]) 
amounts to ^{e**-f"+2(^) | 5„} = ^{e*^"+i('^) | g„} a.s. for each i e R. D 

It is straightforward to verify that condition ([T]) holds if /3 = or if _R„ = 1 for 
all n. 

4. Asymptotic behavior of L„ 

Let A^i be the number of new dishes tried by customer i, i.e., 

N^ = card(S', \ 5,_i) with 5o = 0. 

Note that A^^ is J-^-measurable and Ni \ J^i_i ^ Poi(A,j_i). 
This section is devoted to 

n 

Ln = card(5„) ^^N,, 

the number of dishes experimented by the first n customers. Our main tool is the 
following technical lemma. 

Lemma 2. There is a function h : [c, oo) — > R such that 

r(c + i) i + /i(c + ELi^ 

■^ cxj una ii„ = Li 
In particular. 



swplx h(x)\ < cx) and A,, = a — -, ^^ — -, — 77- for all n> 1. 



An < ^^_p and |A„+i - A„| < ^_^ foralln>l, 



const I const 

—r—T and |A„+i - A„ < - 
n'- P n 

where "const" denotes a suitable (non random) constant. 

Proof. Just note that p(^_r^> = x^~^{l + h{x)), with h as required, for all x > c; 

see e.g. formula (6.1.47) of [Tj. Further, (c + J27=i^i)~^ — (wri)~^ for all n > 1, 
since Ri > u > for all i. D 

A first consequence of Lemma [2] is that, for /3 < 0, customers eventually stop to 
experiment new dishes. 

Lemma 3. P(Ni > j^ infinitely often) = 0. In particular, 

P{N,>0 infinitely often) =0 if [3 < 0. 
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Proof. Fix an integer A; > 1. Since Ni+i \ Fi ^ Poi(Ai), 

P[N,+^ >k)= E{P{N,+, > k I J^O} = E{e-^' E tI " '^■ 

j>k ^' 

ByLemma[l£;(Af) = 0(i-(i-^)'=). Letting fc = l + [l/(l-/3)], since fc (1-/3) > 1, 
one obtains ^^ P{N, > 1/(1 - jS)) = J2^ P{Ni > k) < oo. D 

If /? < 0, Lemma [3] yields L„ —4 J^iLi ^i < °^- If /^ G [0, 1), the a.s. behavior 
of L„ for large n can be determined by a simple martingale argument. Define 



— 1 " 

Rn — — / Ri 






and suppose that 

(3) Rn — ^ ?" for some constant r. 

Since Ri > u for all i, then r > m > 0. Define also 

A(/3) = ^if/3 = and A(/3) = ^^^ ^ if /? G (0, 1), 

a„(/3) = logn if /3 = and a„(/3) = r/ if /3 e (0, 1). 
Theorem 4. If P E [0,1) and condition ^ holds, then 

t^n a.s, 



anW) 



A(/3). 



Proo/. By LemmaH A, = a fg±i| (c + ELi ■^'0'^ ' {l + H^ + ELi ^») } where 
the function h satisfies \h{x)\ < (k/x) for all a; > c and some constant fc. Write 

S->. _ r(c + i) E-^/-Mf + i^.r^ _ 



a. 



.(/3) r(c + /?) a„(/?) 



, ^ r(c + i) E;=i(c + ELi^O'^'M^ + ELii?. 

where D„ = a- 



T{c + (3) a„(/3) 

In view of ^, one easily obtains D„ -^^ and J=,g^ ^ ^^ '^(Z?)- Next, define 



N, - E{N, I J-,_i) ^ ^ iV, - A,„i 



-0=0 and z.=Y. '"-::2^r -]; 

Then, (T„) is a martingale with respect to (Jvi) and 

p.^2^ v M(^.--A,--i)^} f. g{ig((iV,--A,_i)^|^,-_0} _ " i;(A,-_i) 

Since i?(A,) = 0(j-(i-'3)), then sup„ £;(T„2) = J2^^^E1^ < oo. Thus, r„ 
converges a.s., and Kronecker lemma implies 

, Ln , EU^^ r E^LiVi ,. Ao + E;r>, 

iim — — — - — hm — — = lim — - — — — — = hm -^ ~ \(p) a.s.. 

" a„(/3) " a„(/3) " a„(/3) " a„(/3) 

D 
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Theorem H] essentially states that, as far as the weights i?„ meet the SLLN, L„ 
behaves for large n as in the standard IBP model. Note also that, the i?„ being 
independent, a sufficient condition for Q is 



sup-E(-R^) < oo and 



Eti E{R.) 



We next turn to the limiting distribution of i„. To get something, stronger 
conditions on the i?„ are to be requested. 

Theorem 5. Let P e [0, 1). // 



(4) Rn 

for some constant r, then 



— > r ana — 



R,~r 



VaJM 



v/M^ { ^J^) -MP)}^ AA(0, A(/3)) stably. 
Proof. We first prove that 

(5) v/M^{- 



V" A 



a„(/3) 



i-A(/3)} 



0. 



Let /? = 0. Then, by Lemma [2l condition ([5]) is equivalent to 



Y^n-l f 1 ]_\ 



%/logn 



0. 



Since c + X^Li ^« > SLi ^^ ^ i ^' ^^i^^ 



^ 



c + ELiR^ n 



E 



< 



c + ELi^.-^j 



E 



< 



R,-r 



ur j'^ 



ur ]^ 



ur J 



Hence, condition ^ implies i?|yn| — > 0. This proves ([5]) for /3 = 0. The case 
/? e (0, 1) can be handled in exactly the same way. Thus, condition ([5]) holds. 
Next, define 



Un = ^an{P)[ 



£„ E"=iA,-i. E"=i(^.-A,-i) 



Sn(/3) an(/3) 



^/ajj) 



In view of (O, it suffices to show that C/„ — > A/'(0, A(/3)) stably. To this end, for 
n > 1 and j = 1, . . . ^n, define 



Un 



N, - A,_i 



T^n.o — J^o a-nd 7?.„ ,■ — J^, 



Then, E(Un,j \ T^n.j-i) = a.s., Tin,j C TZn+ij and C/„ = X]j t^nj- Thus, by the 
martingale CLT, [/„ — > JV{0, A(/3)) stably provided 

n 

(i) Z1^"J ^ ^(^)' (ii) n^^^'' l^».jl ^ 0' ("i) sup£:{ max [/^ } < oo; 
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see e.g. Theorem 3.2, page 58, of [15]. Let 

Hj = {Nj - Aj_i)2 and !?„ = 



e;=i{^.-s(^.i-^.-i)} 



anW) 

By Kronecker lemma and the same martingale argument used in the proof of The- 
oremlH Z?„ ^^ 0. Since Rj ^—> r and E{Hj \ J^j-i) = ^j-i a.s., then 

This proves condition (i). As to (ii), fix fc > 1 and note that 
max [/„ < =f^- — '- + max — -^ < =f^ — '- + sup — f— for n> k. 

Hence, limsup„maxi<j<„ U^ < limsup„ !gs and condition (ii) follows from 

Hn _ ^j = l J Z^j^l Hj g.s 

a«(;5) a«(;5) an(/3) 

Finally, condition (iii) is an immediate consequence of Lemma [5] and 



£;{ max Ul^} < 

l<7<n -^ 



<J<" "^^J - a«(/3) «„(/?) 



D 



Again, using that (i?„) is an independent sequence, it is not hard to show that 
condition (|4|) is satisfied provided 



sup£;(i?^) < oo and y/n^ \ogn{E{Rn) ~ r} — ^ 0. 

n 

In particular, ((4]) holds in case sup„£'(i?^j) < oo and E{Rn) — r for all n. Note 
also that, letting i?„ — 1 for all n, Theorem [5] provides the limiting distribution of 
Ln in the standard IBP model. 

A last remark is in order. Fix a set B £ B and define 

to be the number of dishes, belonging to B, tried by the first n customers. The same 
arguments used for L„ = Ln{X) apply to Ln{B) and allow to extend Theorems |4][5] 
as follows. 

Theorem 6. Let /? G [0, 1) and B E B. If condition ([3]) holds, then 

^ ^ m(S)A(/3). 
an[P) 

Moreover, under condition (|4]), one obtains 

y/^U^ {^^^ ^ m{B)X{(3)} -^Af{0, m(B)A(/3)) stably. 

Proof. Let Ni{B) denote the number of new dishes, belonging to B, tried by cus- 
tomer i. Then, L„(B) = J2"^iN^{B) and N^+i{B) \ T^ - Poi(m(B) A^). There- 
fore, it suffices to repeat the proofs of Theorems |3][5] with Ni{B) in the place of Ni 
and m{B) Ai in the place of A^. D 
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5. Asymptotic behavior of Kn 

5.1. The result. Let Ki — Mi{X) be the number of dishes experimented by cus- 
tomer i and 

— 1 " 
n ^-^ 

i—l 

the mean number of dishes tried by each of the first n customers. In IBP-type 
models, Kn is a meaningful quantity and one (natural) question concerns its limiting 
distribution. Such question is answered by the next result. 

Theorem 7. Suppose (3 e [0, 1/2) and 

supRn<b, E{Rn)~^r, E{RI) ^ q, 

n 

for some constants b, r, q. Then, 

n 

K^!^Z and - y KJ ^ Q 

for some real random variables Z and Q such that Z^ < Q a.s.. Moreover, 
V^ {Kn - Z] — > 7V(0, cr^) stably, where a^ = ^\^ {Q - Z^). 

Theorem [7] is a consequence of Theorem 1 of |6 . The proof, even if conceptually 
simple, is technically rather hard. 

Theorem [7] fails, as it stands, for (3 e [1/2, 1). As an example, y/n{Kn — Z^ 
does not converge in distribution for /? e (1/2, 1), even if _R„ = r for all n and some 
constant r ^ 1. 

It is worth noting that, since q> r^ and Q > Z^ a.s., the (random) variance a^ 
of the limit kernel satisfies a^ > a.s.. 

Among other things. Theorem [7] can be useful to get asymptotic confidence 
bounds for Z. Define in fact 



,;,{ (2/..)E,.«? _.||l^^._y;|. 

^n i—l 

Since a^ ——> <J^ and a^ > a.s., one obtains 

/{g„>o} ^^ ^ ^AA(0, 1) stably. 

Thus, Kn ± ^ (?n provides an asymptotic confidence interval for Z with (approx- 
imate) level 1 — a, where Ua is such that A/'(0, 1)(uq, oo) — a/2. 

Letting i?„ = 1 for all n. Theorem [7] applies to the standard IBP model. In 
this particular case, moreover. Theorem [7] holds for every /3 € [0,1). In fact, the 
assumption /3 < 1/2 is only needed in checking condition (j) (see the proof below). 
But, in view of Lemmafl] if i?„ = 1 for all n then (M„) is c.i.d. with respect to the 
filtration (Gn)- Thus, (M„) is c.i.d. with respect to (Tn) as well, and condition (j) 
(with Un — J-n) is automatically true. 
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5.2. The proof. We begin with a couple of results from [6 . Let (X„) be a sequence 
of real integrable random variables, adapted to a filtration (Un), and let 

Zn = E[Xn+l I l^n)- 

Lemma 8. IfJ2n''^^^^(-^n) < o^ "■''^d Zn —> Z, for some real random variable 
Z , then 



^k a.s. y 



1 " 

— } Xk ^^ Z and n} 

n ^-^ ^-^ K^ 

fc— 1 A:>n 

Proof. This is exactly Lemma 2 of [6]. D 

Theorem 9. Suppose (X^) is uniformly integrable and 

(j) n^E{{EiZn+i I Un) -Znf}^0. 

Then, Z„ ^-4 Z and (l/n) X]fe=i -^k ~~> Z for some real random variable Z . 
Moreover, 

^l-^Xk-z\ — >JV{0,U + V) stably 

k=l 

for some real random variables U and V , provided 

(jj) .B{supj.>i Tfc |Zfc_i - Zk\} < oo, 

(JJJ) T. ELil^fe - Zk-i + k{Zr,-i - Zu)Y A U, 

(jv) nY.u>n{Zk-i-Zuf^V. 

Proof. First note that {Zn) is a quasi-martingale because of (j) and it is uniformly 
integrable for (X^) is uniformly integrable. Hence, Z„ —^ Z. By Lemma [H one 
also obtains (1/n) Y^^=i^k —^ Z. Next, assume conditions (jj)-(jjj)-(jv). By 
Theorem 1 of [B] (and the subsequent remarks) it is enough to show that 

^/TlE{s^^'p\Zk-l - Zk\ 1^0 and -^E{ max k \Zk-i - Z^,\} ~-^ 0. 

Let Dk = \Zk-i — Zk\. Because of (jv), 

nDl = nY^Dl--^^in + l) Y. DI ^ 0. 

k>n k>n+l 

Thus supj.>„ ykDk ~— 4 0, and condition (jj) implies 

^/^E{s^x{>Dk] < E{anpVkDk} — ^ 0. 

k>7i k>n 

Further, for 1 < m < n, one obtains 

E\ max kDh] < E\ max kDh] + \/nE\ max Vk Dh] 

'-l<k<n ' 4<fc<m -' ''m<k<n ' 

<E{ max kDk} +V'nE {sup VkDk}. 

l<k<m k>m 
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Hence, it suffices to note that 

limsup^^-EJ max kD^.} < -Ej sup \/kDf;\ for all m, 

and liinm _E{supj.>„ ^/kD^} =0. D 

Note that condition (j) is automatically true in case (X„) is c.i.d. with respect 
to the filtration iUn)- We are now able to prove Theorem [T] 

Proof of Theorem [7l We apply Theorem [9] with Xn = Kn and Un = J^n- Let 

T,t=i Rt + c 

Note that 

Etl R^ T..^S,. M,{X} -(3Ln _ j:tl R^K, -P^n 



XI "^"(^ 



Er=i -^^ + c ELi ^* + c 



and recall the notation 

Qn = ^11 y (7{Rn+l) = cr(Afi,...,M„,i?i,...,i?„,i?„+i). 

Uniform integrability of (K^). It suffices to show that sup„i?{e*^"} < oo 
for some i > 0. Let g{t) = e* — 1 and 

Arguing as in Lemma [1] and since A„ < Dn"^^ for some constant D, one obtains 

E{e*''-^' |^„} =e3W^" n {l+gW^n(^)} <exp{g(i)A„+5W E •^«(^)} 

xes„ xeSn 

Hence, it is enough to prove that sup„ i?{e*'^"} < oo for some t > 0. Define 
Un — „„"|^p ■ Since L/„ is tj„-measurable, 

£;(e*^"+i) = i?{exp [tWn (1 - Un)) ii;(e*^"^"+i | g„)} 

< £;{exp ( ^^f_^"^ ) exp (i W„ + (g(t f/„) - i C/„) V7„) }. 

Since [/„ < h/{nu) and W^i = /iTi ^ Poi(a), it follows that 

i?(e*^"+0 <a„(t)i?(e''"W'^^) < oo 

for suitable constants a„(i) and &„(*)■ Thus, i?(e*^") < oo for all n > 1 and i > 0. 
Observe now that g{z) <2z and g{z) — z < z^ for z € [0, 1/2]. Since f/„ < b/{nu), 
thentUn < 1/2 for n > {2bt)/u. Hence, if t e (0, 1] and n > {2b) /u, then 



(6) ij(e*^..+^) <exp{±ii^^}i?{exp(tP^„ + (iC/j2M^„)} 

< exp {^} i?{exp (i Wn (1 + ^)) } 
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where D* = max{2£' (b/u), (6/w)^}. Take t and no such that 
tG (0,1/2], no>^, 11(1 + ^)^2. 

j>no 

Iterating inequahty ([6]), one finally obtains 

£;(e*^"+0 < exp { 5] ^^} i?(e2*^"o) for each n > no- 

This proves the uniform integrability of (K^). 

We now turn to condition (j). Since Af„+i | J>j ^ BePivn), 

Z„ - E{Kr,+i I Tn) = iyn{X) = A„ + ^ J„(a;) - A„ + '^n ' p ^ " ■ 
On noting that L„ = i„_i + iV„, a simple calculation yields 

Rn{Kn — Zn^i) + i?„A„_i — /3Nn 



Zn — Zn-l — A„ — A„_i + 



Condition (j). Since Rn+i is independent of (Mi, . . . , M„, M„+i, i?i, . . . , i?„), 
E{Kn+i I g„) = ^(if„+i I Tn) = Zn and E{Nn+i I e„) = ^(iV„+l I Tn) = A„ a.s.. 
It follows that 

EyZn+l — Zn I J^n) = -B|i?(Z„+i — Z„ | CJ„j | Fn) 

-i^l A„+i - A„ + -— q^j— — — I y-„| a.s.. 
Hence, 

E{{E{Zn+i I J:,0 - z„)'} = £;{^(^„+i - ^„ I Fnf} 

u^ n 

By Lemma H £;{(A„+i - A„)2} = 0(n2/3-4) and ^(A^) = 0(^2/5-2). Hence, 
condition (j) follows from /3 < 1/2 (or equivalently 4 — 2/3 > 3). 

Having proved condition (j) and {K"^) uniformly integrable. Theorem [5] yields 
Zn -^ Z and Kn ^^ Z for some Z. We next prove (1/n) Y^=i ^f ~^ Q ^^^ 
some Q such that Q > Z^ a.s.. Recall that 

sup£'(i<r^) < — supi?(e*^") < oo for a suitable t > 0. 

n t n 

Hence, by Lemma El (l/n) J27=i ^^i ~^ Q Provided E[K'^^^i \ Fn) -^ Q- 

Almost sure convergence of E[K^_^i \ Fn) and Q > Z^ a.s.. Letting 
Gn = X^xss Jn{x)'^, it is uot hard to see that 

Thus, since Z„ — >■ Z and {Gn) is uniformly integrable, it suffices to prove that 
{Gn) is a sub-martingale with respect to {Qn)- 
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Let US define the random variables {Tn,r : n, r > 1}, with values in X U {oo}, as 
follows. For n — 1, let Ti,r = oo for r > Li. If Li > 0, define Ti^i, . . . , Ti^Li to be 
the dishes tried by customer 1. By induction, at step n > 2, let 

Tn,r = Tn-i,r for \ < r < i„_i and Tn^r = oo for r > Ln- 

If Ln > Ln-1, define r„.L„_i+i, • ■ • ,Tn.L„ to be the dishes tried for the first time 
by customer n. Then, cr(r„.j.) C Gn for all r > 1. Letting J„(oo) = 0, one also 
obtains 

^n / ^ '-'nyJ-n.rf • 

r 

For fixed r, since a(Tn^r) C ^n, it follows that 

E{Jn+liTn,r) \ Gn} = ^{^{r<L„} Jn+l(Tn^r) \ Gn} 

~(i + ELl R^M^{Tn,r} + i?„+lS{M„+l {T„,,} | Gn} 
- Hr<L„} , ^„+l 



C ^ 



Y::^R^ 



_ J Jn{Tn,r) {c + ^i=i Rj} + Rn+lJn{Tn,r) _ j /rp \ 

— -'{r<L,.} , v^n+1 „ — •Jn{-Ln,T) a.S.. 

Then, 

E{Gn+l \Gn}^ e\^ Jn+l{Tn+l,r? I Gn] > E {j^ Jn+l{Tn,rf I Gn] 

r r 

= Y,E{Jn+l{Tn.rf \ Gn} > ^ S{ J„+l (T„,,) | Gn}^ = JI ^» (^",')' = ^n a.S.. 
r r r 

Therefore, (G„) is a (tj„)-sub- martingale, as required. 

From now on, Q denotes a real random variable satisfying 

n 

E{Kl+,\Fn)^Q and -Y^KJ ^ Q. 

i=l 

Let us prove Q > Z^ a.s.. Let y„ = Jn{Tns)- Since (Fn) is a [0, l]-valued sub- 
martingale with respect to {Gn), one obtains y„ ^-4 Y for some random variable 
Y. Thus, 

Q - Z2 =. lim{S(A;2 I j;„) - Z2} = lim{Z„ - G„} 

= lim {A„ + Y, Jn{Tn,r) (l " Jn{Tn.r)) } 

r 

>iimy„(i-r„)-r(i-r) a.s.. 

n 

Since L„ — ^ oo (because of Theorem 2]) then T„^i ^ oo eventually a.s.. Hence, 
arguing as in [2] and [17] (see also Subsection 4.3 of [6]) it can be shown that Y has 
a diffuse distribution. Therefore, <Y < 1 and Q - Z'^ >Y {1 -Y) > a.s.. 

We finally turn to conditions (jj)-(jjj)-(jv). 

Condition (jj). Since E{Z^^^) = E{E{Kn \ J^„-i)'^} < E{K^), then 

sup ii;{i^;t + z4_i + At_, + K}<2 sup e{k:^ + ki_, + iv^} < oo. 
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Therefore, 

oo 



<Di }^n |^|(A„-A„_i) 1 + -3 j 

n— 1 

n=l 

where Di and -D2 are suitable constants. 

In order to prove (jjj)-(jv), we let 

2 
U^^^{Q-Z^) and V = ^ {Q - Z^). 






Condition (jjj). Let X„ — {Kn — Zn-\ + n{Zn-i ~ Zn)} ■ On noting that 
^„n^£'{(Z„ — Zn-i)^} < 00, as shown in (jj), one obtains ^nn''^E{X^) < cx). 
Thus, by Lemma[51 it suffices to prove i?(X„ | Tn-i) -~-^ U. To this end, we ffi^st 
note that 

E{iKn - Z„_i)2 I J-„_i} ^ E{K^ I Tn-i} - Zl_^ ^Q-Z^. 

We next prove 

In fact, 

^^^ Rl{K^-z^_,f , ^ ^ , ii;{ii;^(ir„-z„_i)^| j-„_i} 






ri-1 n \2 



(E:Ui?. + c)^ ' '-'i- (Et'R^ 



(^^) 

^n-1'^ 



2E{Rl)E{{Kn-Zn^l)^\Tn-l} a.s. Q (Q ^ Z 



2\ 



[Rn-l] 



= V. 



n tj< ,^^„ -t:- y-„_i > > n : t^ > V. 



Similarly, since i?„ < b 

rBl{K^-Z^^_^ ^ 

This proves condition (*) while (**) can be shown by exactly the same argument. 
Finally, it is not hard to see that 

n'E{{Z^^, Z^f I ^„_i} - n^g| ^^f"~^"-;f | .F„-i| ^ 0, 

nii;{(i^„ - Z„_i) (Z„_i - Z„) I j-„_i} + nii;| ^"^^"~/7^^' I J-„_i| ^ 0. 

Therefore, n2^{(Z„_i - Z,,f \ Fn-i] "^ V and 

2nE{iKr, - Zn-i) {Z^-x - Z„) | J-„_i} ^ -2 (Q - Z^), 
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which in turn implies E{Xn \ J"„-i) "^ V - {Q - Z^) = U. 

Condition (jv). Let X,, = n^Z„-Zn-i)\ Since ^„ n2^{(Z„-Z„_i)4} < oo 
and n^E\^{Zn-i — Zn)"^ \ J^n-i} -^ V, as shown in (jj) and (jjj), Lemma E] yields 

Xk a.s, 



nj:(Z,_,-Z,f=nj:^^V. 



k>n k>7i 



In view of Theorem [HI this concludes the proof. 



D 
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